Disambiguating prepositional phrase attachment sites with sense information captured in contextualized distributional data
نویسنده
چکیده
This work presents a supervised prepositional phrase (PP) attachment disambiguation system that uses contextualized distributional information as the distance metric for a nearest-neighbor classifier. Contextualized word vectors constructed from the GigaWord Corpus provide a method for implicit Word Sense Disambiguation (WSD), whose reliability helps this system outperform baselines and achieve comparable results to those of systems with full WSD modules. This suggests that targeted WSD methods are preferable to ignoring sense information and also to implementing WSD as an independent module in a pipeline.
منابع مشابه
Improving Disambiguation of Prepositional Phrase Attachments Using the Web as Corpus*
The problem of disambiguating Prepositional Phrase (PP) Attachments consists in determining if a PP is part of a Noun Phrase, as in He sees the room with books, or an argument of a verb, as in He fills the room with books. Volk has proposed two variants of a method that queries an Internet search engine to find the most probable Prepositional Phrase attachment. In this paper we apply the latest...
متن کاملCombining Dependency Parsing with PP Attachment
Prepositional phrase (PP) attachment is one of the major sources for errors in traditional statistical parsers. The reason for that lies in the type of information necessary for resolving structural ambiguities. For parsing, it is assumed that distributional information of parts-of-speech and phrases is sufficient for disambiguation. For PP attachment, in contrast, lexical information is needed.
متن کاملPrepositional Phrase Attachment Disambiguation using WordNet
In this thesis we use a knowledge-based approach to disambiguating prepositional phrase attachments in English sentences. This method was first introduced by S. M. Harabagiu. The Penn Treebank corpus is used as the training text. We extract 4-tuples of the form [ V P , NP1, Prep, NP2 ] and sort them into classes according to the semantic relationships between parts of each tuple. These relation...
متن کاملAnalyzing Human and Machine Performance In Resolving Ambiguous Spoken Sentences
Written sentences can be more ambiguous than spoken sentences. We investigate this difference for two different types of ambiguity: prepositional phrase (PP) attachment and sentences where the addition of commas changes the meaning. We recorded a native English speaker saying several of each type of sentence both with and without disambiguating contextual information. These sentences were then ...
متن کاملA Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity
We present a nearest-neighbor algorithm for resolving prepositional phrase attachment ambiguities. Its performance is significantly higher than previous methods that were tested on the same data set. We will also show that the PP-attachment task provides a way to evaluate measures of distributional word similarities. Our experiments indicate that the cosine of pointwise mutual information vecto...
متن کامل